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Abstract. We discuss some of the basic implications of recent results on galaxy correlations published by the 
SDSS collaboration. In particular we focus on the evidence which has been recently presented for the scale and 
nature of the transition to homogeneity in the galaxy distribution, and results which describe the dependence of 
clustering on luminosity. The two questions are in fact strictly entangled, as the stability of the measure of the 
amplitude of the correlation function depends on the scale at which the mean density becomes well defined. We 
note that the recent results which indicate the convergence to well defined homogeneity in a volume equivalent to 
Oh' that of a sphere of radius 70 Mpc/h, place in doubt previous detections of "luminosity bias" from measures of the 

amplitude of the correlation function. We emphasize that the way to resolve these issues is to first use, in volume 
' limited samples corresponding to different ranges of luminosity, the unnormalized two point statistics to establish 

the scale (and value) at which the mean density becomes well defined. We note also that the recent SDSS results 
for these statistics are in good agreement with those obtained by us through analyses of many previous samples, 
confirming in particular that the galaxy distribution is well described by a fractal dimension D « 2 up to a scale 
of at least 20 Mpc/h. We discuss critically the agreement of this new data with current theoretical models. 
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1. Introduction local universe up to a depth of several hundreds of Mpc. 

In this paper we discuss the results, and implications, of 
The most striking feature of the large scale distribution some recently published studies of the correlation proper- 
of galaxies is their organization in structures as clusters, ties of the galaxy distribution based on new data from the 
super-clusters and filaments around large volumes of space partially completed SDSS. 
empty of visible matter, the voids. Since the first obser- 
vations of the three-dimensional distribution of galaxies 
(Kirshner et al. 1983) evidence has accumulated for the 
existence of such large agglomerations of matter. In the 
eighties the "Great Wall" , a giant filament with an exten- 
sion of about 200 Mpc/h connecting many galaxy groups 
and clusters, was discovered (De Lapparent, Huchra & 
Geller 1989). Currently the largest known structure in 
the local universe is the recently discovered "Sloan Great 

Wall" (Gott et al. 2005), roughly twice longer than the An accurate statistical characterization of galaxy struc- 
Great Wall. Its discovery has been possible with new tures is evidently a key element for any physical theory 
data provided by the Sloan Digital Sky Survey (SDSS - explaining their origin. The earliest observational stud- 
York et al. 2000), one of the most ambitious observational ies, from angular catalogs, produced the primary result 
programs ever undertaken in astronomy. It will measure (Totsuji & Kihara 1969) that the reduced two-point corre- 
about 1 million redshifts, giving a complete mapping of the lotion function ^(r) = — 1 (where n is the density 
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of points) is well approximated, in the range of scales from 
about 0.1 Mpc/h to 10 Mpc/h, by a simple power-law 1 : 

w^y m 

with 7 w 1.8 and r ~ 4.7 Mpc/h. This result was sub- 
sequently confirmed by numerous other authors in dif- 
ferent redshift surveys (e.g., Davis & Peebles 1983; see 
Peebles 2001 for a recent review). However, while £_E(r) 
shows consistently a simple power-law behavior charac- 
terized by this exponent, there is very considerable varia- 
tion among samples, with different depths and luminosity 
cuts, in the measured amplitude of £e(?-). This variation 
is usually ascribed a posteriori to an intrinsic difference 
in the correlation properties of galaxies of different lumi- 
nosity — "luminosity bias" (see e.g., Davis et al. 1998): 
brighter galaxies present larger values of r$. Theoretically 
it is interpreted as a real physical phenomenon, as a man- 
ifestation of "biasing" (Kaiser 1984), or, more recently, in 
terms of "halo models" (see e.g. Cooray & Sheth 2002). 

Such a variation of the amplitude of the measured cor- 
relation function may, however, be explained, entirely or 
partially, as a finite-size effect i.e. as an artifact of sta- 
tistical analysis in finite samples. The explanation is as 
follows (see Gabrielli et al. 2004 — hereafter GSLJP 
for a detailed discussion, and original references). The re- 
duced correlation function £(r) can be written as £(r) = 
< ' n ^ >p — 1, where (n(r)) p is the conditional density of 
points i.e. the mean density of points in a spherical shell 
of radius r centered on a galaxy (the subscript p indicates 
the condition that the density is measured from an oc- 
cupied point). The latter is generally a very stable local 
quantity, the reliable estimation of which at a given scale r 
requires only a sample large enough to allow a reasonable 
number of independent estimates of the density in a shell. 
The mean density (n), on the other hand, is a global quan- 
tity. The size of a sample in which it is estimated reliably 
is not known a priori, but depends on the properties of 
the underlying distribution. Specifically the sample must 
be large enough so that the mean density estimated in 
it has a sufficiently small fluctuation with respect to the 
true asymptotic average density. If (r) and ^ (r) are 
estimates of £(r) in two samples with, respectively, mean 
densities and nfg \ one has 

$\r)*>*%rg\r) (2) 

in the regime where £^ (r), £^ (r) 3> 1. A linear ampli- 
fication of £e(?*) in more luminous samples will thus be 
observed if, for example, the fainter galaxy samples probe 
volumes smaller than those at which the mean density be- 
comes well defined in the underlying distribution. There is 
in fact such a systematic effect in observations, as fainter 
galaxies can only be seen at smaller distances. 

1 The subscript E indicates the statistical estimator of £(r) 
in a given sample 



3. Finite size effects in the measurements of 
galaxy correlations 

It has been pointed out by Pietronero (1987) that, when 
analyzing a point distribution which, like the galaxy distri- 
bution, is characterized by large fluctuations, one should, 
in fact, first establish the existence of a well defined mean 
density (and ultimately the scale at which it becomes well 
defined and independent of the sample size, if it does) 
before a statistic like £(r), which measures fluctuations 
with respect to such a mean density, is employed. Further 
the existence of power-law correlations, which are clearly 
present in the galaxy distribution, is typical of fractal dis- 
tributions, which are asymptotically empty. In such dis- 
tributions the mean density is always strongly sample de- 
pendent, with an average value decreasing as a function 
of sample size. Given the observation of such correlations 
in the system, and the instability of the amplitude of the 
correlation function £_E(r) estimated in different samples, 
special care should be taken in establishing first the scale 
(if any) at which homogeneity becomes a good approxi- 
mation. The simplest way to do this is in fact to measure 
the conditional density (n(r)) p , or, alternatively, (n*(r)} , 
the integrated conditional density, where n*(r) represents 
the density in a sphere of radius r about an occupied 
point. These quantities are generally well defined, and give 
a characterization of the two-point correlation properties 
of the distribution, irrespective of whether the underlying 
distribution has a well defined mean density or not. A sim- 
ple power law behavior (n(r)) p = Br~^ is characteristic 
of scale-invariant fractal distributions, with the exponent 
7 < 3 giving the fractal dimension through D = 3 — 7. 
The pre-factor B is, in this case, simply related to the 
lower cut-off of the distribution (GSLJP) 2 . If the distri- 
bution has a well defined mean density, one has, asymp- 
totically, (n(r)) = (n*(r)) = const. > (i.e., D = 3 
in the previous formula). Measurement of these quantities 
can thus both characterize (i) the regime of strong clus- 
tering and (ii) the scale and nature of a transition to ho- 
mogeneity. Only once the existence of an average density 
within the sample size is established in this manner does 
it make sense to use £(r). And, in that case, the analysis 
with (n(r)} can determine the minimal size of samples 
required to make the use of this quantity meaningful. 

In studies of galaxy data from numerous different sur- 
veys using the conditional density (see GSLJP and ref- 
erences therein, in particular Joyce, Montuori & Sylos 
Labini 1999) it has been found that, at scales up to roughly 
20 Mpc/h, where the statistics are very robust (i.e. where 
a reasonable number of independent spherical shells can 
be fully inscribed in the sample volume), the galaxy dis- 

2 We remark that in nature any fractal distribution has a 
lower and an upper cut-off between which the characteristic 
scaling proprieties are manifested: the infinite volume limit is, 
of course, simply a useful mathematical idealization. In par- 
ticular the hypothesis that the galaxy distribution manifests 
fractal scaling always refers implicitly to some finite range of 
(observable) scales. 
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tribution is well described by a simple fractal scaling with 
D w 2. At larger scales (up to <~ 100 Mpc/h, or even 
greater) it has been argued on the basis of these analyses 
that there is weaker statistical evidence for the continu- 
ation of a fractal scaling, and no clear evidence for ho- 
mogeneity. These analyses not only place in question the 
physical meaning, of the amplitude of £(r), as discussed 
above, but also produce a different value of the exponent 
characterizing galaxy clustering at small scales 3 . Indeed, 
as £(r) w (n(r)) p at the scale where £(r) > 1, the two 
statistics should agree at these scales. The discrepancy can 
be explained (see GSLJP) as a result of the way in which 
the dimension D is normally estimated from £e(?*): it is 
determined by fitting a power-law in a log-log plot around 
the scale r defined by ^e(to) = 1- If there is a strong 
break from simple power-law behavior in £ (r) around this 
scale the estimated dimension D' = 3— 7' is systematically 
smaller than D = 3 — 7 (e.g. with £(r) = (ro/r) 7 e~ r//r ° one 
has 7' = — d(ln£)/d(lnr)\ r = ro =7+1). For example one 
may note that Hawkins et al. (2003) by fitting £e (r) in the 
regime r < r$ find that 7 s=s 0.75, while at scales of order 
ro they measure 7 w 1.6. On the other hand Tikhonov 
et al. (2003) measured a value of about 7 = 0.6 -f- 0.8 by 
studying the conditional density in different galaxy sam- 
ples; in addition they found evidences for a crossover to 
homogeneity for scales large than 100 Mpc/h. Similar re- 
sults were found by Baryshev & Bukhmastova (2004) in 
some early SDSS samples, by considering the two-point 
conditional density. 

4. New SDSS results on galaxy correlations 

In HEB3GS the integral conditional density (n*(r)} p has 
been estimated in the "luminous red galaxy" (LRG) sam- 
ple of the SDSS survey. It is an approximately volume lim- 
ited (VL) sample with very precise photometric calibra- 
tion, and is by far the largest such sample ever considered 
for such an analysis. Its size allows the robust estimation 
of this statistic up to a scale of order 100 Mpc/h, as at 
this scale the sample contains a considerable number of 
independent (non-overlapping) spheres centered on galax- 
ies. The results can be summarized as follows: (i) A sim- 
ple power-law scaling corresponding to a fractal dimension 
D = 2 gives a very good fit to the data up to at least 20 
Mpc/h, over approximately a decade in scale; (ii) at larger 
scales (n*(r)) continues to decrease, but less rapidly, un- 
til about ~ 70 Mpc/h, above which it flattens up to the 
largest scale probed by the sample (100 Mpc/h). The tran- 
sition between the two regimes is slow, in the sense that 
the integrated conditional density at ~ 20 Mpc/h is about 
twice the asymptotic mean density. 

3 Note that we are discussing here the results of measures of 
£(r) and (n(r)} p in redshift space. Such studies (e.g. Park et al. 
1994, Benoist et al. 1996) have consistently reported a dimen- 
sion D = 1.2 — 1.5. Note also that finite size effects perturb 
the estimations of the correlation exponent differently for the 
correlation function and the power spectrum (see Sylos Labini 
& Amendola 1996 for more details). 



Let us consider the conclusions which can be drawn 
from this analysis in view of the discussion in the previ- 
ous section. Firstly, the results are highly consistent with 
the claim that galaxy correlations, up to approximately 20 
Mpc/h, are well characterized by a simple power-law scal- 
ing corresponding to a fractal dimension D w 2. The LRG 
sample probes the scaling of (n*(r)) only from about 3 
Mpc/h, and evidently only for the brighter galaxies repre- 
sented by this sample. The same exponent has, however, 
been observed to describe well the behavior of (ra*(r)) 
by our analysis of many precedent galaxy samples over a 
large range of luminosities, and down to scales an order of 
magnitude smaller (see GJSLP and references therein). 

Secondly, the analysis provides good statistical evi- 
dence for homogeneity at larger scales, with a flattening 
of (n*(r)) apparent above Ao w 70 Mpc/h. Future data, 
and in particular the even larger forthcoming samples from 
SDSS, will confirm or refute this very important result, 
extending the range of scale between the detected Ao and 
the sample size 4 . If one takes this scale to be a reliable 
determination of the scale of homogeneity, it implies that 
£(r) is a well defined statistical quantity. Further one can 
infer the size of samples in which one can usefully employ 
this statistic to characterize the statistical properties of 
the galaxy distribution. Here "usefully" means that the 
finite size effects which can systematically offset the am- 
plitude of the estimator £g(r) from that of £(r) are under 
control, so that a physical significance can be attributed 
to this amplitude (to a degree of accuracy which can be 
determined). For this to be the case one requires that the 
sample be large enough so that (i) the sample mean den- 
sity approximates sufficiently well to the true asymptotic 
mean density, and (ii) the sample to sample fluctuations 
(i.e. variance) in the density are sufficiently small. The 
requirement on the sample size imposed by (i) is deter- 
mined by comparing the conditional density at the depth 
of the sample with the measured asymptotic density, and 
the results of HEB3GS indicate that a sample of depth 
greater than w 70 Mpc/h is sufficient to make such sys- 
tematic effects small. To determine the constraint imposed 
by (ii) one needs a measure of fluctuations in the density in 
such a sample about the (now well defined) mean density. 
HEB3GS provides, beyond the measure of the conditional 
density (which gives information about the average and 
not the fluctuations), one measure of such fluctuations 
in the LRG sample: for 0.2 < z < 0.35 in ten disjoint 
regions with volumes V w 2.2 x 10 7 (Mpc/h) 3 field-to- 

4 We remark, in particular, that in HEB3GS the density in 
each sphere is not actually calculated by dividing the number 
of points by its volume. Instead the division is by the number 
of points in the same volume in an artificial catalog, which is 
constructed to correct for effects leading to incompleteness of 
the sample. This actually involves building in the assumption 
of homogeneity at large scales. It is stated that the sample 
is sufficiently complete that this is "essentially equivalent" to 
dividing by the volume. The dependence of the results on this 
difference should however be quantified carefully and controlled 
for in future analyses. 
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field fluctuations are measured to be v , , — w 0.07, after 
subtraction of a Poisson noise term. Such fluctuations, in 
general, depend not only on the volume of the region, but 
also on its geometry. However, if the sample is sufficiently 
large to include several spheres of radius of order Ao , we 
can suppose such an equivalence. Using the estimations of 
HEB3GS we would thus expect that, to reduce the finite 
size fluctuations in the amplitude of £e (r) to of order ten 
percent, we need (i) samples including complete spheres 
of radius A w 70 Mpc/h, and (ii) an equivalent depth at 

least R e ss (^f) 1/3 175 Mpc/h (using = V). 

Let us consider how these values for Ao and R e com- 
pare with those of redshift surveys prior to SDSS: in the 
2dF survey (e.g. Peacock et al. 2001), which has VL w 0.05 
sr, the largest enclosed sphere in a VL sample is of radius 
~ 30 Mpc/h and the greatest depth of such a sample is 
considerably smaller than R e « 450 Mpc/h; in the ESP 
catalog (Guzzo et al. 1999) the radius of the largest en- 
closed sphere is ~ 20 Mpc/h, while the largest VL sample 
has a volume approximately one hundred times smaller 
than that corresponding in the LRG sample to fluctua- 
tions of order 10% in the density. The determinations of 
HEB3GS imply that these, and indeed all, previous mea- 
sures of £(r) in redshift catalogs prior to SDSS are ex- 
pected to be severely affected by the finite size effects we 
have discussed. 

5. Implications with respect to other results from 
the SDSS collaboration 

In another recent paper (Zehavi et al.2004A) the SDSS 
collaboration reports a study of the correlation proper- 
ties of a sample of 200,000 galaxies, covering a solid angle 
of about 47r/16, with the standard method of £(r) estima- 
tion. The paper focuses on the dependence of this quantity 
on galaxy properties, finding in particular that the ampli- 
tude of the measured two-point correlation function rises 
continuously with absolute magnitude from M r w —17.5 
to M r rts —22.5, with the most rapid increase occurring 
above the characteristic luminosity (M r w —20.5). The 
scale ro (determined by £e(r) = 1) varies in the range 
[2.7, 10.0] Mpc/h 5 . 

As we have discussed above, the results of HEB3GS 
indicate that one expects finite size effects in the ampli- 
tude of £e(7") to be reduced to of order ten percent un- 
der two conditions: the samples (i) enclose fully a spheres 
of radius w 70 Mpc/h, and (ii) have at least a depth 

R e w (If-) 1 ^ 3 170Mpc/h. These criteria are in fact sat- 
isfied only by the largest VL samples considered in Zehavi 
et al. (2004A) , and so there is a priori evidence that such 

'' The paper actually follows what has become a standard 
practice in the last few years: results are given only for the 
"projected real space" correlation function, rather than the 
redshift space correlation function, which we have considered 
here. This makes it much more difficult to compare the results 
for the estimated dimension, and so we will not focus on this 
point here. 



finite size effects, and not a real physical effect, may be, 
wholly or partially, responsible for the measured variation 
in tq. Zehavi et al.(2004A) do actually consider indirectly 
this possibility, and report a test of this hypothesis in 
which the correlation functions are measured in adjacent 
luminosity samples cut to cover the same spatial volumes. 
Of the four pairs of samples there is reasonable stability in 
only one case, in the three others the result clearly favors 
the hypothesis of volume dependence. The authors argue 
that their results indicate the opposite conclusion, and as- 
cribe the observed variation between the different volumes 
a posteriori to "anomalous" fluctuations of the density in 
specific samples due to the presence of the "Sloan Great 
Wall" . Rather than being anomalous, such fluctuations are 
simply indicative of the intrinsic correlations of the galaxy 
distribution. 

In a subsequent paper by the SDSS collaboration 
(Zehavi et al. 2004B) a standard £(r) analysis is applied to 
the same LRG sample of galaxies considered in HEB3GS. 
In this case HEB3GS provides a justification for the use 
of this statistic, as it has shown that the sample does con- 
tain many independent spheres with radius larger than 
that at which the conditional density flattens. What is 
found in Zehavi et al.(2004B) is that £_E(r) shows a con- 
siderable degree of stability in different samples. In par- 
ticular the full correlation function of the LRG galaxies in 
redshift space is, within the estimated error bars, in very 
good agreement with that of the most luminous galax- 
ies (i.e. largest volume samples) measured in Zehavi et 
al. (2004 A). This appears to give a quite robust deter- 
mination of the scale r ~ 13 Mpc/h 6 , the stability of 
which it will be interesting to see confirmed by forthcom- 
ing larger samples of SDSS. On breaking the LRG sample 
into three sub-samples by luminosity, however, Zehavi et 
al.(2004B) finds some variation in the amplitude of ^(r), 
which is attributed again to "luminosity bias" . The vari- 
ation is however at a level of only 20% in the amplitude, 
compared to a factor of almost ten between the brightest 
and faintest sample in Zehavi et al.(2004A). This is at a 
comparable level to the fluctuations in the mean density 
found at the scale of the sample in HEB3GS and so this 
"detection" of luminosity bias appears highly question- 
able. Indeed Zehavi et al. (2004B) measure £e(t") in three 
different subsamples of the LRG sample, corresponding 
to three redshift bins, and find variations in amplitude at 
the same level. In this case they conclude that this vari- 
ation is "likely to reflect large-scale structure variations" 
i.e. precisely the finite size effects we are discussing. 

To clearly determine whether the observed variation 
in amplitude is a manifestation of a real difference be- 
tween galaxies of different luminosity, or a volume effect, 
for which we have argued there is a better prima facie 
case, requires, we believe, a much more systematic statis- 
tical analysis. The simplest way to perform such an anal- 
ysis is via the conditional density (n(r)) (or alternatively 

This value (in redshift space) can be inferred from the data 
given in Table 2 of Zehavi et al. (2004B). 
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via (n*(r)) ), performed on galaxies with different prop- 
erties (and, in particular, luminosities) in VL samples. By 
doing so, one obtains both a characterization of the two- 
point correlation properties (and in particular exponent or 
exponents) characterizing the clustering in the strongly 
clustered regime, and determines the scale at which ho- 
mogeneity is established. A fundamental question is then 
whether galaxies of different luminosity show the same be- 
havior of (n(r)) p , and, more specifically, whether they are 
characterized by the same scaling dimension in the regime 
of power-law correlations, and the same homogeneity scale 
Aq . The considerable amount of data we have analyzed in 
the past (for discussion and references, see GSLJP) in- 
dicated that (n(r)) p has approximately the same simple 
power-law behavior D ~ 2 in the ranges of scales probed, 
with only a very weak dependence on luminosity. Once 
such a systematic analysis has been performed, and the 
homogeneity scale (or scales) established, a £(r) analysis 
can be used to characterize correlation in the regime of 
small amplitude fluctuations. Note that a linear amplifi- 
cation of £(r), with no distortion in the range £(r) > 1, as 
a function of luminosity would correspond in the (n(r)) p 
analysis to an observed dependence of the homogeneity 
scale, Ao, on galaxy luminosity. 

6. Theoretical Implications & Open Questions 

We conclude with some remarks on other related observa- 
tional and theoretical questions about galaxy correlations. 

We have underlined that the results of HEB3GS for 
the conditional density confirm the characterization, to a 
very good approximation, of the galaxy distribution by a 
single exponent D ss 2 up to 20 Mpc/h. The most straight- 
forward interpretation of such an observation is that it 
indicates a scale-invariant fractal behavior over the corre- 
sponding range of scale. However, other explanations are 
possible which invoke an underlying distribution which is 
not fractal (see GSLJP, and references therein) . One such 
possibility is a distribution of spherically symmetric den- 
sity profiles described by a rapidly decaying power-law 
about their centers. Indeed this corresponds to what is 
observed in cosmological N body simulations of dark mat- 
ter and used in phenomenological models ( "halo models" , 
see Cooray & Sheth 2002) for the formation of structure 
by gravity. One can perform further statistical tests to 
differentiate the two qualitatively different distributions, 
notably by more direct determinations of the dimension 
by box-counting methods, or of the behavior of the con- 
ditional variance as a function of scale (Bottaccio et al. 
2004; GSLJP). Results obtained by some of us (Coleman 
& Pietronero 1992; Sylos Labini, Montuori & Pietronero 
1998) support the fractal interpretation, but with a lim- 
ited statistical confidence. With the data now becoming 
available from SDSS it should be possible to determine 
the answer to this basic question about the nature of the 
galaxy distribution more definitively. 

Beyond this specific basic question about the nature of 
the galaxy distribution at scales well below the homogene- 




r (Mpc/h) 

Fig. 1. Behavior of (n(r)) p for "CDM particles" in a 
ACDM simulation run by the Virgo consortium (Jenkins 
et al. 1998) together with the determination by HEB3GS 
and those in several samples of the CfA2 catalog reported 
in Joyce, Montuori & Sylos Labini (1999). Note that (i) 
the power-law index in the simulated data is quite differ- 
ent to that in the observations (7 ~ 1.8 instead of 7 ~ 1 - 
Baertschiger, Joyce & Sylos Labini 2002); (ii) the homo- 
geneity scale detected in the LRG sample, A ~ 70Mpc/h, 
is very significantly larger than that in the simulated data 
(Ao ~ 10 Mpc/h). See text for discussion. 

ity scale, i.e. in the regime of strong fluctuations, a more 
general question concerns the agreement between current 
cosmological models and the data on the galaxy distribu- 
tion over all scales probed by current data. HEB3GS states 
that the correlation properties of LRG galaxies measured 
are compatible with those predicted in a standard ACDM 
model, with a "bias factor" of order 2. In this paper the 
evidence for this conclusion comes solely from the compar- 
ison, which gives the quoted bias factor, of the amplitude 
of the measured fluctuations in the volume equivalent to 
a sphere of radius 175 Mpc/h with the prediction of this 
theoretical model, rather than from a comparison of the 
model with the data over a range of scales. In FigQwe 
show on a single plot the results of HEB3GS and the in- 
tegrated conditional density calculated in a large ACDM 
simulation of dark matter. The theoretical (n*(r)) of the 
dark matter shows an approximately power-law behavior, 
characterized by an exponent 7 ~ 1.8, followed by a rapid 
cross-over to homogeneity at a scale Ao » 10 Mpc/h. Also 
shown are the results from the analysis of the CfA2 survey 
reported in Joyce, Montuori & Sylos Labini (1999). 

What is implicitly assumed by HEB3GS is that the 
formation of LRGs can be described as a sampling proce- 
dure on the underlying dark matter, which leads, on the 
one hand, to a linear amplification of the fluctuations by 
a factor of 2 at very large scales, while also producing 
the correlation properties observed in the LRGs from the 
very different ones of the underlying dark matter field. 
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So-called "halo models" (see e.g. Cooray & Sheth 2002) 
currently provide the framework to describe such a sam- 
pling: the distribution of dark matter is described as a set 
of halos, to which formation probabilities for the different 
kinds of objects are ascribed a posteriori to produce the 
observed correlation properties. Given that the properties 
of the two distributions are so different, we believe it is 
fair to say that the statement that these observations ac- 
cord with the theory is very weak: any set of observations 
would probably accord with theory given this criteria. 

Given that the notion of "bias" , which has its origin in 
the measurements of variations in the amplitude in CeMj 
is so central in these theoretical constructions, it is evi- 
dently crucial to correctly understand the role of finite size 
effects in determinations of amplitudes of correlation func- 
tions, using the method we have outlined. Only once this 
is done can the theoretical problem in the regime of strong 
fluctuations be properly addressed. The simplicity of the 
correlation properties of visible matter over a range of two 
decades in scale — well described by a simple power-law 
scaling from ~ 0.1 Mpc/h to approximately 20 Mpc/h — 
cannot but suggest that the relegation by current theories 
of visible matter to an epiphenomenon of the underlying 
dark matter may be mistaken. 

On larger scales, where fluctuations become small, the 
comparison between theory and data becomes in principle 
much cleaner, as the theoretical description of this regime 
becomes simpler. Measurement of the signal, which is of 
very low amplitude, becomes, however, much more dif- 
ficult, and indeed it is only with data from SDSS now 
emerging that it is becoming feasible to make such com- 
parisons. Indeed a very recent paper published by the 
SDSS collaboration (Eisenstein et al. 2005) give an anal- 
ysis of galaxy correlations in this regime and claims a 
statistically significant detection of a "bump-like" feature 
in the real space correlation function £(r) at the loca- 
tion predicted by the same ACDM model considered in 
HEB3GS (for a discussion of these real space properties 
of standard models, see GSLJP, Chap. 6). Evidently such 
findings, if confirmed by other galaxy data, provide dra- 
matic evidence in favor of the standard model of struc- 
ture formation. We note (Gabrielli Joyce & Sylos Labini 
2002; GJSLP) that a very characteristic prediction in this 
regime of standard models is the existence of a negative 
power-law tail with £(r) ~ — 1/r 4 at the very largest 
scales which the SDSS survey is now beginning to probe. 
Such a behavior is the direct signal of the "primordial" 
form of the fluctuations in standard cosmological models, 
with a power spectrum which is linear in k at small k. 
Interestingly the simple behavior of the power spectrum 
is destroyed by the "sampling" of the CDM field (i.e. by a 
prescription for how to infer galaxy correlations from those 
of dark matter), while that of the real space correlation 
function remains intact (Durrer et al. 2003; GJSLP). This 
behavior can be understood simply in terms of the very 
specific "super-homogeneous" properties of the primordial 
fluctuation field, which are also encountered (Gabrielli et 



al. 2003) in certain systems widely studied in statistical 
physics. 
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